for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0322 [herein designated as UNQ283 (DNA48336-1309)] (SEQ ID NO:394) and the derived protein sequence 
for PR0322. It is understood that UNQ283 (DNA48336-1309) in fact encodes PR0322, and that SEQ ID 
NO:394 is a representation of the sequence based on sequencing techniques known in the art. 

The entire nucleotide sequence of UNQ283 (DNA48336-1309) is shown in Figure 158 (SEQ ID 
NO:394). Clone UNQ283 (DNA48336-1309) contains a single open reading frame wilh an apparent translational 
initiation site at nucleotide positions 166-168 and ending at the stop codon at nucleotide positions 946-948 
(Figure 158). The predicted polypeptide precursor is 2«) amino acids long (Figure 159). The full-length 
PR0322 protein shown in Figure 159 has an estimated molecular weight of about 28,028 daltons and a pl of 
about 7.87. Qone UNQ283 (DNA48336-1309) has been deposited with ATCC and is assigned ATCC deposit 
no. 209669. 

Regardmg the amino acid sequence of Figure 159, a potential N-glycosylation site is at amino acid 1 10 
of SEQ ID NO:395. The serine proteases, tiypsin family and histidine active site is identified at amino acids 
69 through 74 of SEQ ID NO:395 and the consensus sequence is identified at amino acids 207 tlirough 217 of 
SEQ ID NO:395. The kringle domain proteins motif is identified at ammo acids 205 Ihrougbi 217 of SEQ ID 
NO:395. The putative signal peptide is encoded at about amino acids 1-23. 

Analysis of the amino acid sequence of the fiill-length PR0322 polypeptide suggests that portions of 
it possess significant homology to neuropsm and other serine proteases, thereby mdicatmg that PR0322 is a 
novel serme protease related to neuropsin. 

EXAMPLE 63: Isolation of cDNA Clones Encoding Human PR0526 

A consensus sequence was obtamed relative to a variety of EST sequences as described in Exan5)le 1 
above, wherein the consensus sequence obtained is herein designated DNA39626. Based on the DNA39626 
consensus sequence, oUgonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR0526. 

A pak of PGR primers (forward and reverse) were synthesized: 
forward PGR Primer 5'-TGGCTGCCCTGCAGTACCTCTACC-3' (SEQ ID NO:401); 
reverse PGR primer 5'-CGGTGCAGGTGATTGGCAGGTAGG-3' (SEQ ID NO:402). 
Additionally, a synthetic oUgonucleotide hybridization probe was constiiicted from flie DNA39626 consensus 
sequence whidi had die following nucleotide sequence: 
hvbridization probe 

5'-AGGCACTGCCTGATGACACCTrGGGGGACCTGGGGAACGTGACAG-3' (SEQ ID NO:403). 

In order to screen several libraries for a source of a fiiU-length clone, DNA from flie libraries was 
screened by PGR amplification with the PGR primer pah- identified above. A positive library was then used to 
isolate clones encodmg the PR0526 gene using the probe oligonucleotide and one of the PGR prhners. RNA 
for constiiiction of the cDNA hbraries was isolated from human fetal liver tissue (LIB228). 

DNA sequencmg of the clones isolated as described above gave the fiall-lengtti DNA sequence for 
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PR0526 [hereindesignatedas IJNQ330 (DNA44184-1319)] (SEQ ID NO:399) and the derived protein sequence 
for PR0526. 

The entire nucleotide sequence of UNQ330 (DNA44184-1319) is shown in Figure 1^ (SEQ ID 
NO:399). Clone UNQ330 (DNA44184-1319) contains a single open reading frame with an apparent translational 
initiation site at nucleotide positions 514-516 and ending at the stop codon at nucleotide positions 1933-1935 
5 (Figure 160). The predicted polypeptide precursor is 473 amino acids long (Figure 161). The full-length 
PR0526 protein shown in Figure 161 has an estimated molecular wei^t of about 50,708 daltons and a pi of 
about 9.28. CloneUNQ330(DNA44184-1319)hasbeendepositedwiththeATCC on March 26, 1998. It is 
understood that flie clone contains the actual sequence, whereas the sequences presented herein are representative 
based on current sequencing techniques. 
10 Analysis of the amino acid sequence of the full-length PR0526 polypeptide suggests that portions of 

it possess significant homology to the leucine repeat rich proteins including ALS, SLIT, carbojq^eptidase and 
platelet glycoprotein V thereby indicating that PR0526 is a novel protein which is involved in protein-protein 
interactions. 

Still analyzing SEQ ID NO:400, the signal peptide sequence is at about amino acids 1-26. A leucme 
15 zipper pattern is at about amino acids 135-156. A glycosaminoglycan attachment is at about amino acids 436- 
439. N-glycosylation sites are at about amino acids 82-85, 179-182, 237-240 and 423-426. A von WUlebrand 
factor (VWF) type C domam(s) is found at about anuno acids 411-425. The skilled artisan can understand 
which nucleotides correspond to these amino acids based on the sequences provided herein. 

_20 EXAMPLE 64 : Isolation of cDNA Clones Encoding Human PR0531 

An ECD database was searched and an ejqjressed sequence tag (IBST) from LIFESEQ™, Incyte 

Pharmaceuticals, Palo Alto, CA was identified which showed homology to protocadherin 3. Based on this 

sequence, a search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 

EnzymolosY 266:460-480 (1996)) as a comparison of the ECD protein sequences to a 6 frame translation of the 
25 EST sequence. Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did 

not encode known proteins were clustered and assembled into consensus DNA sequences with the program 

"phr^" (Phil Green, University of Washington, Seatde. Washington). 

A consensus DNA sequence was assembled relative to other EST sequences using phrap. Based on the 

consensus sequence obtained, oligonucleotides were synthesized: 1) to identify by PCR a cDNA hbraiy that 
30 contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence 

forPR0531. 

A pair of PCR primers (forward and reverse) were synthesized: 
forward PCR nrimer 5'-CTGAGAACGCGCCTGAAACTGTG-3' (SEQ ID NO:406); 
reverse PCR primer 5'-AGCGTTGTCATTGACATCGGCG-3' (SEQ ID NO:407). 
35 Additionally, a synthetic oligonucleotide hybridizationprobe was constructed from the consensus DNA sequence 
which had the following nucleotide sequence: 
hvbridization probe 

5'-TTAGTTGCTCCATTCAGGAGGATCTACCCTTCCTCCTGAAATCCGCGGAA-3' (SEQ ID NO:408). 
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In order to screen several Ubraries for a source of a fuU-Iengtti clone, DNA fix)m the Ubraries was 
screened by PGR amplification with the PGR primer pair identified above. A positive Ubrary was then used to 
isolate clones encoding the PR0531 gene using the probe oUgonucleotide and one of the PGR primers. RNA 
for construction of the cDNA hTsraries was isolated from human fetal brain tissue (LIB 153). The cDNA libraries 
used to isolate the cDNA clones were constructed by standard methods using commerciaUy available reagents 
such as those from Invitrogen, San Diego, GA. The cDNA was primed with oUgo dT containing a Not! site, 
linked with blunt to SaH hemikinased adaptors, cleaved with NotI, sized appropriately by gel electrophoresis, 
and cloned in a defined orientation mto a suitable cloning vector (such as pRKB or pRKD; pRK5B is a precursor 
of pRK5D that does not contain the Sfil site; see, Hohnes et al.. Science . 253:1278-1280 (1991)) in the unique 
Xhol and NotI sites. 

DNA sequencmg of the clones isolated as described above gave the fim-length DNA sequence for 
PR0531 [herem designated as UNQ332 (DNA483 14-1320)] (SEQ ID NO:404) and the derived protein sequence 
for PR0531. 

The entu-e representative nucleotide sequence of UNQ332 (DNA483 14-1320) is shown in Figure 162 
(SEQ ID NO:404). It is understood that the actual sequence is that within tfie clone deposited v/ith flie ATCC 
as DNA483 14-1320. Clone UNQ332 (DNA483 14-1320) contains a single open reading frame with an apparent 
translational initiation site at nucleotide positions 171-173 and endmg at the stop codon at nucleotide positions 
2565-2567 (Figure 162). The predicted polypeptide precursor is 789 amino acids long (Figure 163). The fiill- 
length PR0531 protein shown in Figure 163 has an estimated molecular weight of about 87,552 daltons and a 
pi of about 4.84. Glone UNQ332 (DNA483 14-1320) has been deposited with the ATCC on March 26, 1998. 

Analysis of the amino acid sequence of the fiiU-iength PR0531 polypeptide suggests that portions of 
it possess significant homology to protocadherin 3. Moreover, PR0531 is found m the brain, like other 
protocadherins, thereby indicating that PR0531 is a novel member of the cadherin superfamily. 

Still analyzing the ammo acid sequence of SEQ ID NO:405, (he cadherin extracellular repeated domam 
signature is found at about amino acids 122-132, 231-241, 336-346, 439-449 and 549-559 of SEQ ID NO:405. 
An ATP/GTP-binding site motif A (P-loop) is found at about amino acids 285-292 of SEQ ID NO:405. N- 
glycosylation sites are found at least at about amino acids 567-570, 786-790, 418-421 and 336-339 of SEQ ID 
NO:405. The signal peptide is at about amino acids 1-26, and the transmembrane domain is at about amino acids 
685-712 of SEQ ID NO:405. 



30 EXAMPLE 65: Isolation of cDNA Clones Encoding Human PR0534 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 
above, wherein the consensus sequenceobtained is herein designated DNA43038. Based on the 43048 consensus 
sequence, ohgonucleotides were synthesized: 1) to identify by PGR a cDNA library that contamed the sequence 
of interest, and 2) for use as probes to isolate a clone of the full-length codmg sequence for PR0534. 

A pair of PGR primers (forward and reverse) were synthesized: 
forward PGR primer 5'-GACAGAGGGAGAAGTGGGGGAATC-3' (SEQ ID N0:411); 
reverse PGR primer 5'-CCACATGTTCCTGCTCTTGTCCTGG-3 ' (SEQ ID NO:412). 
AdditionaUy, a synthetic oUgonucleotide hybridization probe was comtructed from the consensus DNA43038 
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